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Abstract 

It is shown that in the multivariate case the orders p, of the AR part, and q, 
of the MA part, are not invariants of the time series. Thus, it is concluded 
that it only makes sense to define the class of ARMA(p,p)- irreducible 
models, where p is the biggest of the system's Kronecker indices. This 
class is shown not to be a differentiable manifold, but to contain one, 
which is a generic subset of systems with all Kronecker indices equal to 
p. A formula which gives the metric tensor for riemannian manifolds of 
linear systems as a line integral in the complex plane is introduced for 
deterministic and stochastic cases and some tensors are obtained with it. 

MSC-class: 93C05; 93A30; 93B29; 93B30 

1 Introduction 

Since the publication of Box and Jenkins book 3j, the statistical methodology pro- 
posed by these authors has been disseminated and widely used to build mathematical 
models of time series. These models are essentially discrete time differential equations 
(thus, indeed difference equations), of which the order and the parameters (supposed 
constant) are determined as a function only of a unique sample of its output se- 
quence - the time series - which is considered as a particular realization of a gaussian 
stochastic process resulting from a forced term (the input of the system) which is 
also considered as an unobservable gaussian process. The orders p of the difference 
equation and q of the input noise are identified with the aid of correlation and partial 
correlation functions. 

In the beginning of the 80's, Tiao and Tsay [16j , of the statistics department 
of Wisconsin University, proposed a methodology analogous to Box and Jenkin's 
to the multivariate case, that is, the one in which several time series are treated 
simultaneously, being viewed as a vector time series. The advantage of this approach 
is that the separate treatment of the series would not take in consideration their 
interrelations, which otherwise would allow the building of a model of greater 
predictive capacity. The proposed methodology is based upon a direct generalization 
of the correlation and partial correlation functions, to identify the orders p and q of 
the model. 

It happens, nevertheless, that there is not a method to determine isolately the 
orders p and q of a multivariate ARMA(p,q) model nor is it possible to come up 
with one, simply because, contrarily to the univariate multivariate ARMA 

model has not independent intrinsic p and q orders. This will be shown using the 
polynomial representation of linear systems and unimodular matrices. 

The necessary correction is also proposed, with the introduction of the 
ARMA(p,p)-irreducible models. Given the formal resemblance between this new 
class of models and the class of linear systems of a common McMillan degree, a nat- 
ural question is if this new class also constitutes a differentiable manifold. If this was 
the whole new set of parametrizations would be unveiled, as it happened with 

the so-called overlapping parametrizations. 

Since the space of all linear dynamical systems with a common McMillan degree 
and the generic set in the ARMA(p,p)-irreducible class are differentiable manifolds. 
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it makes sense to be able to do identification on a manifold (see references [12] and 
[T3] for the basic results on this area). The resulting path in the systems space until 
convergence will be almost independent of the parametrization This could be 
termed coordinate-free identification. 

To use this approach, the Riemannian metric tensor G must be computed, since 
the Riemannian gradient is given by G~^VJ, that is, times the gradient of a 
convenient objective function J. Furthermore, the local geometric properties of the 
manifold are defined by G and, as a consequence, some of its global properties too, 
like for instance the geodesies equations, which are obtained by the integration of a 
set of partial differential equations based upon G. 

In ^2]; pp. 149-155, a recipe is given for the obtention of G. It consists on a formula 
for the norm of tangent vectors (which are systems derivatives) as a function of the 
matrices of a state space representation, needing the solution of a Lyapunov equation; 
this norm takes straightforwardly to the metric tensor. Identification algorithms 
which require the solution of Lyapunov or Riccati equations are common in the 
literature. 

Here a formula for the metric tensor is obtained, which gives it directly as a 
functional of the system's transfer function. The generality of that formula enables 
the use of any representation besides state space (at least two other ones are known: 
ARMA and matrix pencil [21) and doesn't require the solution of Lyapunov or Riccati 
equations. Particularized formulas for ARMA and state space representations are 
obtained, using overlapping parametrizations. 

2 Definitions 

An ARMA(p,q) (Auto-Regressive Moving-Averages) model for a vector time series 
{Vt}, y G T^"^, t = 0,1,2,3..., is an equation of the type 

(1) Aoyt + Aiyt-i + ... + A^yt-p = BqEi + BiSt-i + ... + B^St-q 

where the Aj and the Bj are mxm square matrices and {et} is a gaussian white 
noise of null mean and covariance R. 

In the frequency domain (z-transform) , the model becomes 

(2) (Ao + Aiz-^ + ... + Apz-'P) Y{z) = {Bo + Biz-' + ... + B^z-^) t{z) 
where 

oo 

(3) Y{z) = Y,y^^-' 

1=0 

and 

oo 

(4) e(z) = ^5,^- 

j=0 

and z is a complex variable (if z is restricted to the unitary circle in the complex 
plane, it will have module 1 and, so, will be able to be written as z = e'^',with 1=^/^ 
and cij G 7?.; in this case, the z-transform is reduced to the discrete Fourier transform, 
which justifies the expression "frequency domain "). 
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Now, let 

(5) A{z) = Ao + A^z-^ + ... + ApZ-P 

(6) B{z) = Bq + Biz-^ + ... + B^z-'i 
Then, the frequency domain equation can be written as 

(7) Y{z)^A{z)-'B{z)e{z) 

(In the interesting cases, the nondegenerate ones, A(z) is invertible). 
The so called transfer function of the model is 

(8) H{z) = A{z)-'B{z) 
Thus, 

(9) Yiz) = H{z)e{z) 

It can be proved that, if {^tjis a gaussian stochastic process, {yt} will also be. 

As the definition of H(z) implies that its elements will be fractions whose 
numerators and denominators are polynomials in z, that is, rational functions, the 
conclusion is that ARMA models represent linear dynamical systems, that is, systems 
whose input-output relation is of the type 

oo 

(10) yt = J2^^^t-i 

i=0 



where H{z) = Hi, 



1=0 

being the Hj p X p square matrices known as the system's Markov parameters 
(or weighting sequence). A linear dynamical system is defined by its sequence {Hi} 
of Markov parameters, so that each system s can be viewed as a point (or vector) in 
the Hilbert space of these sequences, defined by: 

Sum of two systems: S3 =Si-|- S2 is the system such that H^^^^ = H^^"^^-!- H,^^^ , i 
= 0,1,2,.... 

Product of a system by a scalcir a : S2— asi is the system such that 
Hf = aHf ) , i = 0,1,2,... 

Internal product between two systems: 

00 

<s„s.> = tr j:Hl'\HtY 

li=0 

where T represents matrix trans])()sition. 

Norm of a system:||s|| = V< Si, S2 > 

Distance between two systems:d(si,S2) = ||si— S2II 

These definitions in terms of {Hj} are equivalent to the following ones in terms 
of H(z): 
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Sum of two systems: S3 =Si+ S2 is the system such that 113(2;) = Hi(2;)+ 112(2;) 

Product of a system by a scalar a : 82= asi is the system such that 
H2(z) =aHi(z) , i = 0,1,2,.. 

Internal product between two systems: 

<Si,S2 > = 2^ ^ {E^{z)El{z-^)\ z-^dz, 

where i=A/— land C is the unit circle in complex plane. 

Norm of a system:||s|| = ^/<^^^J^> 

Distance between two systems:d(si,S2) = ||si— S2II 

The system will be stable if the minimum common multiple of the denominators 
of the elements of H(z) (expressed as a rational matrix with all polynomials in z, not 
in z~^(the B operator of Box & Jenkins)) has all of its roots inside the unit circle 
in the complex plane. The system's stability is a condition that guarantees that to 
a stationary input process {st} there corresponds a stationary output process {yt} , 
stationarity here understood as time invariance of all the moments of the stochastic 
process. 

Let, now, the system's Hankel matrix be defined by 

(11) 

and search its lines top down, retaining only the ones which are linearly 
independent with the preceding ones. Associated to there are m (the number 
of components of the output) positive integers, the so called Kronecker indices of 
the system; the index n, is the number of retained lines corresponding to the i-th 
component of the vector time series {yt} . The so called McMillan degree of the 
system is the rank of 7Y, which will be denoted by n. It is clear, then, that 

m 

(12) n = 

3 Why aren't p and q invariant 

In the univariate case, m=l, A(z) e B(z) are polynomials in tT^ . Adopting the 
convention of using low case letters for scalars, the transfer function can be written 
as 



Hi H2 H3 
H-2 Hs H4 



If b(z)=p(z)c(z) and a(z)=p(z)d(z), where p(z), c(z) and d(z) are polynomials in 
z~^, the common factor p(z) can be cancelled, so that the same system (same transfer 
function) can be represented by the model 
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(14) c{z)Y{z) = d{z)e{z) 

If p was the degree of a(z), q of b(z) and r of p(z), the new model will be 
an ARMA(p-r,q-r). Thus, cancelling all the common factors of a(z) and b(z), an 
ARMA(p*,q*) with p* and q* minimal is obtained. It is obvious that any increase or 
reduction in p has, necessarily, to be accompanied by the same increase or reduction 
in q, since increases or reductions require adding or cancelling of common factors in 
the fraction h(z). The conclusion is that, in the scalar case, p* and q* are individually 
invariant, that is, given a system, it makes sense to refer to the order p* of its AR 
part and the order q* of its MA part and trying to identify them. 



In the vector case, nevertheless, this is not true. In this case, the transfer function 



IS 



(15) H{z) = A{z)-^B{z) 

If A(z)=P(z)C(z) and B(z)=P(z)D(z), with P,C e D matrix polynomials, then 

(16) H{z) = C{z)-^P{z)~'P{z)D{z) = C{z)-^D{z) 

Now, C=P~^A and D=p-^B. If C and D are to be polynomial (so that CY=De 
be an ARMA model), it is necessary that all elements of A and B be divisible by detP 
(since P~^ = -^^^Cof{P)'^). Let d(z) be the maximum common divisor of elements 
of A e B. Define 

where I is the identity matrix. Then, P=d(z)I is polynomial and the only 
polynomial matrices M such that C=M~^C and T>=M~^D are polynomial will be 
the ones whose determinant is not a polynomial in z~^ but a numerical constant; 
such polynomial matrices are known as unimodular matrices. Pairs (C,D) with such 
a property are said left coprime ^Hl- In the univariate case, after the cancelling of 
all common polynomial divisors, there remain only arbitrary numerical constants, 
which, canceled between numerator and denominator, don't change their degrees. 
In the multivariate case, there remain arbitrary unimodular matrices, which, once 
discounted, do alter the AR and MA degrees of the equation. That's why the following 
theorem can be stated: 

Theorem 3.1. In the multivariate case, the minimal p and q are not individually 
invariant. 

Proof: Let s be a system representable by an ARMA(p,q+r) model of type 
[A(z),U(z)B(z)] irreducible (that is, with A and UB left coprime), with U unimodular, 
detAp 7^ e detB^ ^ 0, where 

(18) U{z) = Uo + Uiz-^ + ... + UrZ-'' 

(19) A{z) = A^ + Aiz-^ + ... + ApZ-P 
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(20) B{z) = Bo + Biz-^ + ... + BgZ-'' 

The restrictions over the determinants imply that the product of A(z) or B(z) 
by any polynomial matrix increase their degrees (for instance, U(z)B(z)=Mo + 
Miz~^ + ...Mq^rZ~'^~^'^ , with Mg+r = UrBg ^ O, since UrBg = O would imply on 

UrB,B~' = OB^^ =>Ur=0). 

The coprimeness of (A,UB) implies on the impossibility of reducing the degrees 
of A and UB through the cancelling of a non-unimodular matrix P such that 
(A,UB) = (PM,PN). There remain only the unimodular ones. 

Now, if U is unimodular, will be polynomial (and, by the way, unimodular 
too). So, \J~^A will also be polynomial. 

The conclusion is that (U'^A, U'^UB) = {U'^A, B) will be an ARMA(p+k,q) 
model, where k=degree(U~^), irreducible, which will represent the same system. So, 
there are systems which have two irreducible ARMA representations with different 
degrees. 

Q.E.D. 

Consider, for instance, a system whose transfer function H(z) is itself a unimodular 
matrix. Then, this system has an ARMA(0,q) representation of the type 

(21) Y{z) = H{z)e{z) 
with 

(22) H{z) = Bo + Biz-^ + ... + BrZ-' 
But, in this case, H(z)~^ is also polynomial, since 

(23) = -^^cofm.r 

since det(Hz) is a number and its cofactor matrix is polynomial. 
Then, multiplying the ARMA(0,q) equation above by H(z)~^, an ARMA(q,0) 
equation is obtained: 



(24) H{z)-^Y{z) = H{zY^H{z)e{z) = e{z) 

Thus, there are two pairs, (0,q) and (q,0), of minimal orders (because nor p nor 
q can be smaller then zero) corresponding to the same system. Hannan and Deistler 
([TTj. pg. 77), although don't call attention to this phenomenon, exhibit, en passant, 
an example with q=l. 

Remark 1. Although the systems built in the proof have a minimum p* 
and a minimum q*, this is not useful, since they don't have an ARMA(p*,q*) 
representation. In the case (0,q)-(q,0) exhibited, for instance, the system doesn't have 
an ARMA(0,0) representation (save for very special cases), which would represent a 
white noise. What would be useful for model building is the joint minimality of p 
and q, which would imply on the invariance of the structure of the minimum ARMA 
model. 
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4 The ARMA(p,p)-irreducible class 

Given the impossibility of representing any system by an ARMA(p,q) model with 
p and q jointly minimal, something that can be done is to represent it by an 
ARMA(p,p), that is, with p=q, such that p be the least possible integer. It can 
be proved (see [12], pg. 38) that this minimum value of p is equal to the largest of 
the Kronecker indices of the system. 

The set of all systems representable by an ARMA(p,p) model with A{z) = 
Aq + Aiz~^ + ... + ApZ~P, with p equal to their largest Kronecker index- that is, 
with p=max{nj} - will be called ARMA(p,p)-irreducible class (the word irreducible 
here relates to the impossibility of reducing the value of p). It is a subset of the Hilbert 
space of linear systems. Some subsets of it are made up of systems with more than 
one ARMA(p,p)-irreducible model; so, the ARMA(p,p)-irreducible parametrization 
is not identifiable in the strict sense (although it is in Vajda's sense). (There are 
here, as always, two sets: the set of ARMA(p,p) models and the set of systems 
with maxjuj} = p, which are the image of those models in Hilbert's space. The 
parametrization which maps a set on the other one is said to be identifiable if it is 
biuni vocal). 

The problem posed here is if, analogously to the set of systems with a common 
Yl^i'^i 5 the set of systems with a common maxjuj} is a differentiable manifold; 
for, if that was the case, it would be possible to cover it with a set of charts, thus 
obtaining an overlapping parametrization which would be the most natural for the 
ARMA representation (in the state space representation, the natural thing is to treat 
with the set of systems with a common Yl^i ^« > which is the minimal dimension of 
the state space). To start the analysis, consider, firstly, an ARMA canonical form 
for systems with Kronecker indices n^, i=l...m. It can be obtained by the following 
procedure: 

Aline the components of the predictor y(t/t-l) (y(t) conditioned to the time series 
until time t-1), yi,y2, ■■■,ym, from t-p to p. Next, adopt the procedure indicated for 
the example below, in which m=3, rii = 2,n2 = 2,123 = 2 : 

Vl Z/2 1/3 

o o 

* 

where the leftmost matrix represents time t-2, the central one, t-1, and the right 
one, t. In the second line, the "x " indicate the components which enter in the 
selection and the "o " indicate the first time, in the search from left to right, that 
a component revealed to be linearly dependent with the precedent ones. The third 
line is the second one dislocated to the left, so as to have all L.D. components in the 
same column (the third one). 

Now consider the third column. The first component (indicated in the second line) 
is L.D. with all components two time units behind and with the first and the third 
ones one time unit behind; so, in general, there will be coefficients corresponding to 
these components when writing down the model: 

The second component depends of the first one in the same time and of all one 



(25) 



Vl Z/2 1/3 
* * * 



Vl 1/2 Vs 

* o * 

* * * 
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time unit behind, that is, 

y2{t/t - 1) = 022/1 (t/t - 1) + 622/1 (t - l/t - 2) 

+C22/2(t - l/t - 2) + C?22/3(t - l/t - 1) 

The third one depends on the first and the third ones one time unit behind and 
of all two time units behind, that is, 

ys^t/t - 1) = a3yi{t - 1/t - 2) + 63?/3(t - 1/t - 2) 
+C32/i(t - 2/t - 3) + +rf32/2(t - 2/t - 3) + e-mit - 2/t - 3) 

As to the MA part, all of its matrices are full, with the only restriction that, 
in each line of the equation, the degree of the MA part cannot be greater than the 
degree of the AR part. 



Thus, the canonical structure becomes: 



■ 1 


" 




X 


X 






X 


X 


X 


X 1 





yt + 


X 


X X 




yt-i + 











. 


1 

X 


X X 


X 


X 




XXX 


X 


X 


X 


= et + 


X 


X X 


et~ 


1 + 















X 


X X 






XXX 









Formally, what one has in the general case is (see |Hj, here modified to the 
stochastic case): 



■m ^i] m rii 

(26) yi{t + = aijkVjit + ^ " 1) + &jifc^j(^ + A; - 1) + £i(t + n^) 

j=l k=l j=l k=l 

/ rii, for i=j \ 
where njj = j minjnj + 1, rij}, for i>j 

y min{nj, rij}, for i<j J 
shifting, next, each equation in time, so that in the left member always appear 
Viit). 

The important to consider here are the following three facts: 

1) detAo = 1 always, so that one can multiply all the vector equation by Aq \ to 
obtain a monic model, that is, with Aq = I (mXm identity), without loss of generality. 

2) Only when all Kronecker indices are equal is that all matrices - except for 
Ao, which will be the identity - are full. This is, thus, the case with the greatest 
number of free parameters: 2m^p; the dimension of its image M in the Hilbert space 
of systems will be, thus, also 2m^p. It is known that M is a differentiable manyfold, 
since each set of systems with the same m (number of components of y) and same 
set of Kronecker indices is the image of one of the maps of the chart which defines 
a differentiable manyfold of dimension 2mn P , where n = Yl^i ^* (when all indices 
are equal, one has p = ni = ... = rim', thus n=mp and so 2m^p = 2mn). This is, 
consequently, the generic case of this parametrization. 

3) In all the other cases, the augmented matrix [Ap, Sp] will not be full rank (its 
product by an invertible matrix - Aq ^, for instance - thus, also not). 

Definition 1. The ARMA(p,p)-irreducible parametrization will be defined by 



(27) 



yt + Aiyt-i + ... + Apyt-p = et + BiSt-i + ... + BpSt- 
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with all matrices, in principle, full. The word irreducible denotes the impossibility 
of reducing p. Thus, such parametrization includes all systems with maxjuj) = p 
and only them. 

From the properties of the canonical forms here exhibited, follows the conclusion 
that the image of the set of ARMA(p,p)-irreducible models in the space of systems 
is the union of a differentiable manyfold of dimension 2pm^ (corresponding to all 
systems with ni = ... = rim = p) with sets of lower dimensions (corresponding to 
systems with some Uj 7^ p) . For easier references, it is convenient to state the following 

Theorem 4.1. The generic sub-class of the ARM A (p,p) -irreducible 

parametrization is a differentiable manyfold of dimension 2prri?, where m is the 
number of components of {yt} . 

Proof: The generic sub-class of this parametrization is the one in which all 
matrices are full and rank[y4p, Bq] = m. But in this case the ARMA model is under 
the canonical form of systems with ui = ... = = p, which, as is known [Hj, is a 
differentiable manifold of dimension 2pm^. 

Q.E.D. 

The problem here considered is of knowing if that union of systems sets, that is, 
the complete image, constitutes a differentiable manifold (in this case, its dimension 
would be 2pm^). This will occur if the points (systems) of the non-generic sets 
were regular under any coordinates system, which would require that the set of 
tangent vectors at each point spanned a space of dimension exactly equal to 2pm^. 
Unfortunately, the following theorem shows that this is not the case: 

Theorem 4.2. The set of systems representable by the 

ARM A (p,p) -irreducible parametrization does not constitute a differentiable many- 
fold. 

Proof: Let s be a system representable by this parametrization with some Uj 7^ p. 
Let M(z) = J + Mz~^ with M[Ap,i?p] = O (null matrix). For instance, in the case 



above illustrated(m 


= 3 


,ni 


= 2,^2 




2,^3 = 


2 


), 
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" 




X X 
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X 




' 
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(28) M[Ap, Sp] 
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c 


_ 




X X 


X 
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X 




_ 
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" 1 





" 






a 










' 1 


az 


-1 




" 


(29) M{z) = 





1 





+ 




6 






Z-' 







1 + bz- 


-1 













1 






c 













cz 


1 




1 



As in the systems with some Uj 7^ p the extended matrix [Ap, Bp] is not 
full rank, there always is M with the properties above. In this case, the pair 
[M{z)A{z), M{z)B{z)] will also represent an ARMA(p,p)-irreducible model with the 
same p, for the same system s, whatever be the values of a,b and c. 

Let, now, }ig^{z), i=1...2pm^, be the tangent vectors at a system representable by 
an ARMA(p,p)-irreducible parametrization, where the free parameters were denoted 
by {6i, i = 1,2, ...}. Since H(z) can be factored in 

H{z) = A{z)-^B{z) 
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being A(z) and B(z) the polynomial matrices 



(30) 



A{z) = I + Aiz-^ + ... + Aj,z-P 



(31) 



it follows that 



(32) 



Biz) = I + Biz-^ 



BpZ~P 



Or, 



HeXz) = A-\z)BeXz) - A-\z)AeXz)A-\z)B{z) 



(33) HeXz) = A-\z) (Be^z) - AeXz)H{z)) 

Now, Bg.{z) and A0^{z) are constant matrices (consider, for instance, the case 
m=2, p=2):' 



yt + 


9i 02 
6*3 04 


yt- 


1 + 


' 0, 




Os 


yt~2 


= et + 


' Og ( 

011 


9l2 


£t-i + 


'013 
015 


014 
016 



A 



eAz) 



et-2 

Then, one has, for instance, 

0" 

1 
1 


The matrix H(z) is unique and can be written in terms only of the free parameters 
of the canonical form of s. 

However, if s is of the kind here considered (that is, such that some Uj 7^ p), it 
will have an infinite number of representations [w4,;B], all of them being ARMA(p,p)- 
irreducible with the same p, as already claimed, of the type 

A{z) = M{z)A{Z) 

B{z) = M{z)B{z) 

where A(z) and B(z) are the canonical matrices. 
This doesn't change H(z), since 
n{z) =A{z)-^B{z) = [M{z)A{z)]-^[M{z)B{z)] 
= A{z)~^M^\z)M{z)B{z) = A{z)-'^B{z) = H{z) 

But it does change A(z) in (jH^ and, so, H^., i=1...2pm^. In particular, it is 
always possible to define M(z) non-unimodular, maintaining the properties above 
(the case exhibited is an example: it suffices that 0), such that y[~^{z) , when 
right-multiplied by K^^{z) in (j33p introduce a free parameter (in this instance, b) 
in the denominator of Hg-. Then, any change in the value of this parameter will 
result in a tangent vector (a system) which is L.I. with the remaining ones. Indeed, 
calling 7 =(7i,72, •••,7,-) the vector of free parameters of M(z) which appear in its 
determinant, one concludes that the space spanned by the tangent vectors at s is of 
infinite dimension (thus, different from 2pm^), that is, 

diui{H0^{z)h eEJ-} = 00 
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The conclusion is that points s of the kind here considered (that is, those ones 
not belonging to the generic sub-class) cannot be made regular under any coordinates 
system. Thus, the union of all classes of systems represented by the ARMA(p,p)- 
irreducible parametrization is not a differentiable manifold. 

Q.E.D. 

Remark 2. :The systems belonging to the non- generic case are in a position 
qualitatively similar to the vertex of a two-sided cone: it is not a regular point of the 
surface, but tangent vectors are perfectly definable on it, the anomaly being that they 
span of the i?^ instead of just a plane 



5 Riemannian Metric Tensor 

Consider a parametrization (D,P) which defines a class of dynamical systems, so that 
the elements hij{z) of the transfer function matrix H(z) which defines a system be 
expressed as a function of 9 = {61,62, ■■■,0d), that is, of the local coordinates of one 
of the maps of the differentiable manifold defined by (D,P), whose dimension will be 
denoted by "d ". 

Theorem 5.1. The element (i,j) of the Riemannian metric tensor G of a 
dynamical linear system is given, in terms of its transfer function matrix H(z) and 
an adequate parametrization that expresses it as a function of a finite vector 6 G , 
by: 



9ij 



1 

27ri 



tr 



dH{z) OH 
d6i 



c 



86, 



z dz 



where i=\/—l , C is the unit circle centered on the origin of the complex plane 
and tr stands for trace. 



Proof: 

Consider a system s belonging to the manifold defined by a 
parametrization (D,P). Specifically, suppose that, in one of it's maps, s be 
represented by the vector of local coordinates 6 = { 61, 62,-.., 6d). Let 6 {vi) and 
6 (f 2) be two curves of M passing by s. Let also s'^ and s'2 be the two derivatives of s 
along the two curves, respectively. 

The Riemannian metric tensor G at s is, as already defined, the matrix such that 



6\'G6'2 = g{s'^,s'2) =< s[,s',>= tr 



where Hi and H| 



1=0 



0, 1, 2, 3, are the Markov parameters of the two tangent 



systems. 

By Percival's formula (see ^3], for the scalar case; the generalization used here is 
easily obtained), one has 



tr 



2\T 



1=0 



^Jjr [H,{z)HUz-')]z-'dz 
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where Hi (z) and H2 (z) are the transfer functions of the two tangent systems. 
Now, these transfer functions, because they refer to tangent systems along those 
curves (and recaUing that stabihty of the systems imphes convergence of the 
infinite series involved, thus justifying the commutation between summations and 
derivatives), are given by 

00 00 d 

dHjz) d9i ^ 
dOi dvk 



i=0 i=Q i=l 



=E^^f^k=l,2 



j=l 



where H(z) is the transfer function of s and |^ was abbreviated to 9'^^\ 
Substituting in the last integral, there results 



nl r\l 1 



tr 



c 



j=i 




z ^dz 



which specifies the components of vectors 6'i and 62 . 



where 9.^ 
and e'}^^ 



, i=l,2,...,d 
i=l,2,...,d 



The element gij of tensor G is obtained by 

where Cj = [0, 0, 1, 0, 0] with the " 1 " in the i-th position. 
Substitute, in the last integral, 9[ by and $2 by Cj. 
The summations become: 



E aH(z) n>(l) ^ dHjz) 1 dHiz-^) nf(2) ^ dHjz-^ 

dOn " dei "^^^^ ddn dOj 



n=l 

and there results 



n=l 





'dH{z) dHT(z-^)' 
ddi ddj 


z ^dz 









Q.E.D. 

Example 1. For the scalar ARMA(1,1) parametrization, 

Ut + CLUt-i = Ut + but-i, so that 9 = {9i, ^2) = {a, b), with a, 6 G (—1, 1) 

and a ^ b. The transfer function in this case is scalar, given by 

so that 



z+b 
z+a 



z+e2 



s-. = ^(t tr dHTjz-^) -1^ 

— J_ / ( 9 z+eA ( a z-^+eA ^-i 
~-2ni J ^ yaOi z+e2 ) \d0j z-^+e2 ) 
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Thus, 



2ni 



z+b z 



C 



U2 —821 



1 
27ri 



z+b 1 



C 



{z+ay z-'^+a 



Y—z ^dz 



ab-l 
(l-a2)2 



,22 



2ni 



1 1 ^-1 



C 



z+a z ^+a 



Y—Z ^dz 



l-a2 



As a result, the Riemannian metric tensor in this case is 



G 



4afc-(fe2+l)(a2+i) afe_i 
-2-1)3 (l-a2)2 



^a2-l) 



(l-a2)2 l-a2 

which agrees with the corresponding state-space case given in [1, pg. 222]. 



5.1 Overlapping Parametrizations 

Consider the case in which Hq = I (identity matrix) and jt and Ut G i?™", that is, 
r=m (number of inputs equal to number of outputs). There is no loss of generality, 
since all required is static redefinition of the inputs (or outputs) and the introduction 
of artificial dummy inputs (or outputs). 

Consider, now, the different iable manifold S„ of all systems of this kind with 
McMillan degree n fixed. For 1, it is not possible to cover it with a unique map, 
so that the global chart is made up of a set of maps, each one of them characterized 

m 

by a set of m natural numbers n^, i=l,2,...,m, with ^ 

— fT'- Call M„- ni,n2,...,nm 

the 

i=l 

corresponding map. Then, the coordinates of a system s described by this map are 
defined by the following procedure [TUj : 
Given the system's Hankel matrix 



n 



Hi H2 H3 
H2 H3 H4 



let H* be its i-th block of m lines (for instance, = [H2H3H4...]) and let 
hii, h2i, hmi be the lines (of infinite size) of H*. The fact that a system can be 
represented by the map M„; ni,n2,. -,nm means that the lines 

hii, hin^; /i2i,...,^2n2i ^mi, ■■■hmnm constitutc a base for the space of lines of 7i. 
Thus, the lines /ii{ni+i), ^m(n,„+i) can be written as linear combinations of them: 

m rij 

(34) hi{rn+l) = ^ ^ Otijkhjk 

j=l k=l 
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i=l,...,m. 

Now, call hij{k) the k-th element of line hjj. The 2mn numbers {aijk,k = 
1, ...,nj;i,j = l,...,m, and hij{k),i = l,...,m;j = l,...ni;k = l,...,p} are the 
system's coordinates according to map Mn- ni,n2,...,nm^ that is, the components 6i 
{i = l,2,...,2nm) of the coordinate-vector 6 which appear in the tensor's formula 
given by theorem 1. Let the first nm components of 6 be the {ajjfc} and the last nm 
ones be the {hij{k)}. 

Now, define the matrix K with n rows and m columns as in jlOj . 
Each element of this matrix is one of the coordinates hij{k). 



5.2 Tensor for the ARM A representation 

The autoregressive moving averages representation (ARMA) 

AoUt + Aiyt-i + ... + ApUt-p = BqUi + BiUt-i + ... + BpUt-p 
where yt,Ut G BJ^ , Aj and Bj are m x m matrices and whose 
transfer function is H(z)=A~^ [z) B{z) 
where A(z)= Aqz^ + Aiz^-^ + ... + Ap 

and B(z)= Bo^P + ^i^P^i + ... + 5p 

has an overlapping parametrization as defined in ^Hj in which 

p=max{nj} 

with the aijk already defined, and 
B(z) = A(z) + M(z)K 

where M(z) is a polynomial matrix whose entries are 

pseudo-derivatives (in relation to z) of the entries of A(z) (if f(z)=z™' + aiz'^~^ + 
... -|- Om , then f(^) {z) = z™-~^ + aiz™-~^~^ + ... + flm-fc is its pseudo-derivative of order 
k). 

With the definition of vector 9 given above, notice that matrices A(z) and M(z) 
are functions only of its first nm components, while matrix K is function only of its 
nm last ones. Thus, the following developments can be made: 



Hfz) = A-ifz)B(z) 



f^H{z) = A~\z 



a 



B{z) = ^,A{z) + §-,M{z)K + M{z)§-,K 



with 
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i-Mz) = i-M{z) =0 for i > nm 



for i < nm+1 



So, for i > nm, there results 



d 



H{z) = A- 



^B{z) - ^ 



A-\z)^B{z) 



^A-\z)M{z)^K 



And, for i < nm+1, 

^H{z) = A-\z) \^A{z) 



dM{z) ^ _ dA{z) 



H{z) 



^A-\z) [^K+'-^[I-H{z)]) 



The results above demand a separation in three cases, each one giving rise to a 
corresponding formula for the metric tensor. 

Let I={l,2,...,nm} and J={nm+l,nm+2,...,2nm} be index sets. 



Case 1: ijeJ 



27ri 



tr 



c 



A-^{z)M{z 



M^{z-^)A-^{z-^) 



z ^dz 



Case 2: i,jel 



g., = A. f^tr[A-\z) {^K + ^ [/ - H(z)]) . 

■ {k-'-^ + - H^i^-')] '-^) A-^{z-^)]z-^dz 

Case 3: iel and jeJ 



A-\z) (^ir + ^ [/ - H{z)]) ^M^{z-')A-^{z-^)] z-'dz 
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5.3 Tensor for the state-space representation 

The state space representation 

xt+i = Axt + But 
yt = Cxt + ut 

where Xt G i?" and Ut^yt G RJ^ 

has the overlapping parametrization defined in ^U]; where 

C is a matrix of zeros and ones, A is a sparse matrix in which the only non-nul 
variable entries are the aijk already defined and B = K (as defined above). 
Thus, the following calculations will provide explicit formulas for the tensor: 



H{z) = C{zl - A)-^B + I 
and f H(z) = f C(zI-A)-i5 
and, for i > nm. 



H(z)= J-C(zI-A)-i5 =C(zI-A)-i J-E 



whilst, for i < nm+1 

_d 

ae. 



J- H(z)= J-C(zI-A)-i5 =C i-izl - A)-^ 



B 



C{zI-A)-'§-{zI~A)-^ 



Again, this generates three formulas for the tensor: 

Case 1: i,j€J (see definitions of I and J in the preceding section) 



9ij 



1 

2-Ki 



tr 



c 



C{zl - A) 



-1 dB dB 



T 



dOi aej 



z-^i - Ay^c^ 



z ^dz 



Case 2: i,jel 
3^3 = ^ift^ 



c 



c{zi - Ay'§-{zi - A)-\z-'i - Ay^^{z-^i - Ay^c^ 

Case 3: iGl and jGJ 



z ^dz 



9ij 



1 

27ri 



tr 



C 



C{zl - Ay^HizI - A)-if^(z-i/ - A)-^C^ 



Remark 3. In the deterministic case, letting 6 i = [K] 



kl ' 



z ^dz 
nm + 1, ...,2nm, with 
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k = integer (i/m)-n+l and I = i mod(m), which amounts to naming 9i in lexicographic 
order in matrix K, one has f|^^— = O (null matrix) whenever i mod(m) ^ j mod(m) 
(x mod(y) meaning the rest of division of x by y). Thus, the display of the metric 
tensor formulas for the ARMA and state space representations shows that gij = for 
certain pairs (i,j), since in case 1 the expression H^^—T^ff'^j"; B=K) appears 
in both representations. 



5.4 Stochastic Metric Tensor 

Stochastic linear dynamical systems are here defined as those ones which, besides 
having an input channel for known vector sequences, are permanently being excited 
by a vector white noise, of which only the first and second moments are known, the 
sequence itself being unknown. Hanzon proposes an internal product for the tangent 
space of the manifold of these systems without known input, of fixed McMillan degree 
and equal number of components for the vectors of white noise input and colored noise 
output. The corresponding tensor formula is obtained here, based upon theorem 
5.1.1. A simpler formula is also presented, derived from another suggested internal 
product by that author. Detailed calculations for their applications to some simple 
examples are shown. 

A stationary time series will be considered here as a realization of a gaussian and 
stationary stochastic process {yt} . It will be supposed that y has zero mean, y G TZ^ 
and t =1,2,3,... , that is, time will be discrete. Thus, {yf} is totally characterized by 
its auto-covariance function 

(35) r, = E(rf^J,z = 0,1,2,..., 

where E(.) is the expectance operator over y 's probability distribution function 
and the superscript T stands for transposition. This stems from the fact that the 
joint probability density of a string of size I+l of the process, that is, of the random 
vector w^ = {yf,yf^^, -^yl+i) is given by (H, pg. 90): 

(36) f{w) = [(27r)'"(^+i) det GJ-i/'e-s-^^- 
where 

To Ti T2 ... r, 

rf To Ti ... r,_i 

Being {y} stationary, it follows that Fj = F^j. 

The density above presupposes that G be nonsingular, which is equivalent to w's 
components being linearly independent. 

There is a biunivocal relation between the autocovariance function of a stochastic 
process and its spectral density [7j. In this case, calling T(z) the spectral density, 
one has (^J, pg. 69): 



(37) 



G 
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(38) T{z) = J2 r.^-^ 

i=— oo 

Now, T(z) can always be factored as T(z)=H(z)RH"^(z~^), where H(z) is a rational 
and stable matrix, with stable inverse, and R is symmetric positive definite. 

Imposing, further, that H(z) be causal and Ho=I, there is a unique pair [H(z),R] 
correspondent to rational T(z) ([Oj, pg. 72). 

H(z) may be interpreted as the transfer function of a linear system. As a result, 
{y} will be interpreted as the output of a stable hnear system whose input is a 
unobserved white noise. 



Hanzon jl2j proposes an internal product for the manifold of stochastic systems 
with a common McMillan degree analogous to the deterministic case: 



(39) 



< si, S2 >= tr 



2\T 



i=0 



where Si e S2 are systems belonging to the tangent bundle of the manifold. 

This is a metric of the covariance system, whose Markov parameters are {r.^}. So 
the immediate extension of theorem 5.1.1 to the stochastic case is: 

Theorem 5.2. A stochastic Riemannian metric tensor can be obtained by the 
formula 



(40) 



9ij 



1 
27ri 



tr 



c 



dUjz) dU'^jz-^] 



z ^dz 



where: 



(41) 



1=0 



rj= E(yty^J,i = 0,1,2,..., are the covariances of the stochastic process 
{ytjgenerated by the stochastic system [{Hi},R] , 
tr stands for trace, 

the superscript T indicates matrix transposition, 
"i " is v/^and 

C is the unitary circle centered in the origin of the complex plane. 

A more convenient metric, which can be expressed directly in terms of H(z) and 
R, is the one induced by the internal product defined over the two-sided infinite 
sequence {Ti,i = —2, —1, 0, 1, 2, ...} (reminding that Fj = F^-, Vz G Z): 



(42) 



< Si, S2 >= tr 



2\T 



Hanzon (P^); Pg- 208) refers to this choice as an also quite attractive possibility. 
Defining T(z) = Yl'^_^^ ^iZ~\ it follows the 
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Theorem 5.3. A stochastic Riemannian metric tensor can be obtained by the 
formula 



(43) 



9ij 



27ri/ 

c 



tr 



dT{z) dT^ (.: 

~de7 



z ^dz 



where T(z) = Yl°l_^ ^iZ~'^ = H(z)RH^(z~^) is the spectral density of the output 
{lit} of the hnear dynamic system whose transfer function is H(z) and whose input 
is a white noise {£<} of covariance matrix R and zero mean. 



Proof: Percival's formula 



(44) 



tr 



2\T 



1=0 



-^^jtr [Ai{z)Al{z-^)\ z-^dz 



c 



which was the kernel of the proof of theorem 5.1.1, relates a summation of the 

oc 

sequence {Aj, i G N} to an integral of the z-transform A(z) = ^ A^z'^ . It can also 
be stated as 



(45) 



tr 



2\T 



-^^jtr [A^{z)Al{z-^)\ z-^dz 
c 



where A(z) is now defined by A(z) = ^ AiZ , relating, thus, the summation 

i=—oo 

of the sequence {74j, i e Z} to an integral of this last z-transform. 
The remainder of the proof is equal to that of theorem 5.1. 
Q.E.D. 

The relation between T(z) and U(z) is the following: 
(46) T{z) = U{z) + U^{z-^) - To 



5.5 Some examples of tensors for the stochastic case 

Consider, as examples, the following stochastic systems (all of them scalar, that is, 
yt,Xt,St e R, and, to simplify, R=l): 

5.5.1 In the state space representation 

(47) xt+i = Axt + Bet 



(48) 



yt = Cxt + Dst 
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In this case, the covariance R of St is a scalar and has a purely multiplicative 
effect on the tensor. 

Example 1 

To simplify, let R=l, D=0 and C=l. Then, the system reduces to 
(49) xt+i = axt + h£t 



(50) 



Vt = xt 



whose transfer function is 



(51) 



z — a 



So, its spectral density is 



(52) T{z) = H{z)RH' {z'^) 
The partial derivatives become 



z — a z ^ — a 



[—a + z) (—1 + az) 



da 



9T{z) ^ ofc 
db (z— a)(— 1+oz) 



Then, noticing that T(z)=T(z ^), 
9ii = ^A 



1 

■27ri 



aT{z) dT'^jz-^) 
da da 



l-2az+z^ 



C 

-2b 



{-a+zy(-l+azy 



dT{z) dTT(z-^) 
db db 



z~^dz 
z^^dz 
z~^dz 



1 

"27ri 



C 



{z—a){ — l+az) 



z ^dz 



912 = ^21 = 2- (/) tr 



912 = 921 



1 

'2-!ri 



C 



-2b 



c 



dT(z) &TT{z-^) 
da db 



z ^dz 



(z— a)(— l+az) 



h2^ l-2az+z'^ 
{z-aY{-l-VazY 



z ^dz 



Calculating the integrals, the following tensor is obtained: 



(53) 



2(2a4 + 7a2 + l)64 ^ab'''{o? ^2) 



4 



(a2 - 1)5 
a63(a2 + 2) 

[p? - 1)4 



-4 



a2 - 1)4 
6^(a^ + 1) 
(a2 - 1)3 _ 
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This is an interesting example, because it is also exhibited in Jzj, pg. 223, 
using the metric of theorem 5.2 (but through another method, involving Ricati and 
Lyapunov equations and not a complex integral. The tensor obtained in that work 
is: 



(54) 



G 



(9a2 + 1)64 



6- 



6- 



i63 



{a? - 1)4 (a2 - 1)3 

Notice that the denominators coincide with those of the tensor ()53|) , but not the 
numerators. The tensor can be found by theorem 5.2 by the following process: 
Expand the transfer function in power series 



(55) h{z) 



z — a 



hz'^il + az''^ + a^z'"^ + ...) = hz'^ + ahz'"^ + a%z' 



to obtain the system's Markov parameters 



(56) ho = 0, /ii = 6, /i2 = ah, = a b, hi = b 

The sequence of covariances {r^} can be found by: 



r, = Eivtvl,) = E 



X] HiEt-i ^ HjEt+k- 



i=0 



--E 



J2 HiSt-i Yj ^J+k-jHj 

i=0 j=0 



oo oo oo oo 

= HiE Hj = Y1,Y1, HiR5j-k,iH. 

i=oj=o i=oi=o 



J2 HiRHj^f, 



i=0 



where is Kronecker's delta and reminding that y* = ^ HiEt- 



Substituting the Markov sequence of the example, comes 

oo 

(57) Tfe = ^a^-i6V+'=-^ 



i=0 



Then, 



oo oo 



(58) 



2a^+'=-i)z-'= = -b^ 



{a? -l){-a + z) 



k=0 i=l 

Substituting in formula (^0)) of theorem 5.2, tensor is obtained. 
Notice that equation is satisfied: 
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(59) To = ^ = ^ a'-^b^a'"^ 



a' - 1 

1=0 1=1 



U{z) + U^{z-') - To 



-b' 



2 Z : fe2 \ / ^2 



62 

-z 



(^az-i){-a+z) = (see expression 



a2-l 



Example 2 

The next example is of the important innovations model: 

(60) xt+i = axt + bet 

(61) yt = xt + et 
The transfer function in this case is 

, / N z + b — a 

62 h(z) = — 

z — a 

The spectral density becomes 

— 1 — 1)7' —I— n 7 

(63) T{z) = h[z)h{z-^) = [z^b-a) 



z — a) (—1 + az) 
The result found by theorem 3 is: 

(-a^ + 7b'^a^ + 1 + - - Aba^ + 2b'^a^ + Aba + b^y 

b{a - 2a^ + - Aba^ + 2b'^a^ + Sba'^ + Ab'^a + b) 
(a2 - 1)4 

^ - 2a2 + 1 - Aba^ + 29a^ + Aba + 26^ 



(64) ^712 = 921 = 2 

922 = -2 



(a2 - 1)3 

To use theorem 5.2 semi-infinite metric, compute: 

h(z) = = ^ + ^ = l + -^ = l + bz-^ + abz'^ + a%z-^ + ... 

^ ' z—a z—a z—a z—a 

Thus, 

(65) /ii = a^-^6,z = 1,2,3,... 

(66) ho = 1 
The general expressions 



23 



become, for this case: 



For k = 1,2,3,.. 

oo oo 



i=0 i=l 



oo 



(67) Tu = Y,a'-\hfa'^^-' + a^-^h=-h- + 



1=1 



(68) To = 5^ + i/oi?i^o'^ = I ^"'iPfa'-' ) + 1 = + ^ 



i=l \i=l 

Then, ^(^) = Er=o Tfe^^' = Er=i T.^"' + To 



a2 - 1 



oo 



U{z) = Z -b ^^'-^+^- z-' + 



- (a2-l){-a+^) 

k=l 

Having U(z), it is enough to apply theorem 5.2 formula (jlOl), to get to the 
corresponding tensor: 



(69) = (a^ - 46a^ - 20^ + 3ba^ + a + Gab"^ + b) 



{a-iy{a + iy 

b 



{a - if {a + if 
-2a2 + 462 + 1 + + _ 4^3^ 



- '( 7T37 — ~rTF 

(a — 1) (a + 1) 



5.5.2 In the ARMA representation The ARMA equations is: 

(70) Aoyt + AiUt-i + ... + Apyt-p = B^ut + BiUt-i + ... + BpUt-p 
The univariate ARMA(1,1) case will be exhibited here. It is defined by 

Example 3 

(71) yt + ayt-i =et + bst-i 
whose transfer function is 
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The spectral density becomes 

(73) T{z) = h{z)h{z-') = ^\±1 ^{z + h) 

^ ' ^ ' ^ ' ^ ' z + a^ + a '{az + l){a + z) 

The following tensor is obtained by theorem 3: 

„ _ o -43b^a^-5b^+b^a^-13&^a'^-2a^-2b^a^+24b3a3_7a2-7b4a2-|-i6fea+l6feaa-l-64 
- ^ 

^-Uba - 8ba^ + ba^ - 2b^a^ + Ibb'^a^ + Sa^ - + 1 + 3^2 

(74) gu = -2 , , ,,4 



922 = 2 



(a^ - 1)^ 
- 4a2 - 1 - 2b^a^ + 8ba - 2P 



(a2 - 1)3 

The computations to obtain the tensor prescribed by theorem 5.2 are: 
h(z) = ^ = ^ + ^ 

^ ' z+a z+a z+a 

= -rr^ = 1 - az'^ + a^z-"^ - a^z'^ + ... 

z+a 1+az 

-±- = = 62;-^ - baz-'^ + fea^^^-^ + ... 

z+a 1+az ^ 

Thus, 

(75) h(z) = 1 - (a - 6)^-^ + a(a - b)z-^ - a\a - b)z-^ + ... 
So, 

(76) hi^a'-^{a-b){-l)\i = 1,2,3,... 

(77) /lo = 1 
Now, for 1,2,3,.. 

00 

Tk = Y. «'"^(« - 6)(-l)V+^"^(a - 6)(-l)*+'^ + a^-^(a - b){-lY 

i=l 

(78) r,^(a-6)(-l)^ ^V"r 

(79) To = - hfd-\-ir-'^ + 1 = ~ ^ 
Then, 



fc=i 

The corresponding tensor is 
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9ii = 

_ 3b^a^-ib^a^-4:a^b-7b^a^+24:a^b+24:b'^a'^-i9b^a'^-9a'^-9a'^b'^+20b''a+20ba-l-b'^-7b^ 
„ _ 3a^b-2a'^-4b'^a*-Ga^b+7a'^+nb'^a'^-15ba-6b^a+l+5b'^ 

Sa^ - 4:a% - 6a2 + 126a -1-462 

(80) 922 = ^,3, ^,,3 

(a — 1) (a + 1) 



6 Conclusions 

The differentiable manifold of systems of a common McMillan degree is a natural set 
for the state space representation, because this degree is the number of components 
of the state in the minimal representation, thus appearing explicitly in the models. 
It gives rise to the so-called overlapping parametrizations, which exhibit much more 
flexibility than the canonical ones, which is important for the numerical process of 
identification, given the possibility of changing of model even on-line, whenever a 
malconditioning is detected. 

For the ARMA representation, the establishing of overlapping 

parametrizations based upon classes of models representing systems with a 
common McMillan degree results in a clumsy class of mathematical models j4j, 
because this degree is not natural for that representation. Unfortunately, the natural 
integer - p, the minimum degree of the AR and MA polynomials - doesn't give 
rise, as shown, to a class of models whose image (the systems of a common p) 
is a differentiable manifold, thus not allowing the definition of a natural ARMA 
overlapping parametrization. 

A remedial solution could be to work with a rougher class of models (the 
ARMA(p,p)-irreducible ones), which would not be strictly identifiable for systems 
in which not all Kronecker indices are equal; since the set of systems in which all 
them are equal is generic, maybe this was not such a big handicap... 

The complexity of the Riemannian metric tensor formulas grow exponentially 
with the increase of systems dimensions. There are two ways of attenuating this 
problem: 

1) Computer languages for algebraic symbolic processing like MAPLE and 
MATHEMATICAL^ are sufficiently flexible to integrate numerical algorithms with 
symbolic ones, thus dispensing the need of manual transcription of formulas. 

2) Since the only systems treated in the theory here presented are the stable ones, 
the parameters that appear in the denominator of the transfer function are bounded 
by restriction such as being between -1 and 1. In this case, certain terms of higher 
order in the expressions obtained for the metric tensors can be despised without 
considerable loss of precision, thus reducing the complexity of the formulas. 

The availability of powerful computer packages for algebraic computation turns 
attractive the use of analytical formulas involving complex integrals and partial 
derivatives of polynomial matrices for the study of geometrical properties of spaces 
of linear dynamical systems. 
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Given the close relations between the Riemannian metric tensor, the Fisher 
information matrix, the covariance matrix of the parameters estimators and the 
Hessian matrix of some common objective functions used in parametric identification, 
the author believes that the results displayed in this article hold some relevance for 
the classical problem of linear dynamical systems identification. 
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